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Abstract. The paper provides a simpler method for proving a delicate 

-vj inequality that was used by Achlioptis and Naor to establish asymptotic 

^^ concentration for chromatic numbers of Erdos-Renyi random graphs. 

^ The simplifications come from two new ideas. The first involves a sharp- 

'""5 ened form of a piece of statistical folklore regarding goodness-of-fit tests 

On for two-way tables of Poisson counts under linear conditioning con- 

^~~^ straints. The second idea takes the form of a new inequality that controls 

^^ the extreme tails of the distribution of a quadratic form in independent 

^H Poissons random variables. 



C/5 

O 1. Introduction 



Recently, Achlioptis and Naor (2005) established a most elegant result concerning col- 
orings of the Erdos-Renyi random graph, which has vertex set V — {1,2, ... ,n} and has 
QQ each of the (2) possible edges included independently with probability d/n, for a fixed 

T— I parameter d. They showed that, as n tends to infinity, the chromatic number concentrates 

00 (with probability tending to one) on a set of two values, which they specified as explicit 

^^ functions of d. The main part of their argument used the "second moment method" (Alon 

r — and Spencer 2000, Chapter 4) to establish existence of desired colorings with probabil- 

^^ ity bounded away from zero. Most of their paper was devoted to a delicate calculation 

^_^ bounding the ratio of a second moment to the square of a first moment. 

• . More precisely, A&N considered the quantity 



A.w:=^(i-j) Enroll 




where Hk denotes the set of all k x k matrices with nonnegative entries for which each 
row and column sum equals B := n/k. (With no loss of generality, A&N assumed that n 
is an integer multiple of k.) They needed to show, for each fixed fc > 3, that 

(1) A„(c) = 0(l) when c < (fc - 1) log(/c - 1). 

In this paper we show how the A&N calculations can be simplified by using results 
about conditioned Poisson distributions. More precisely, we show that the desired be- 
haviour of An{c) follows from a sharpening of a conditional limit theorem due to Haber- 
man (1974) together with some elementary facts about the Poisson distribution. 
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In Section 2 we will establish some basic notation and record some elementary facts 
about the Poisson distribution. In Section 3 we will explain how A„(c) can be bounded by 
a conditional expectation of an exponential function of the classical goodness-of-fit statistic 
for two-way tables. We will outline our proof of (1), starting from a y^ heuristic that can 
be sharpened (Section 4) into a rigorous proof that handles the contributions to An (c) from 
all except some extreme values of i. To control the contributions from the extreme i we 
will use an inequality (Lemma 2) that captures the large deviation behaviour of conditioned 
Poissons. The proof of the Lemma (in Section 5) is actually the most delicate part of our 
argument. 

2. Facts about the Poisson distribution 
Many of the calculations in our paper involve the convex function 

(2) /i(i) = (1 + f) log(l + t) - t for-l<t, 

which achieves its minimum value of zero at i = 0. Near its minimum, h(t) = t^/2 + 
0(|tp). In fact, h{t) ~ ^t'^tp(t) where -0 is a decreasing function with iJj{0) = 1 and 
■0'(O) = —1/3. See Pollard (2001, page 312) for a simple derivation of these facts. 
Define No = {0, 1, 2, . . . }, the set of all nonnegative integers. 

Lemma 1. Suppose W has a Poisson(A) distribution, with A > 1. 
(i) //£ = A + Au e No then 

^/2^V{W = £} = cxp ( - \h{u) - \ log(l + m) + 0(1/^)) 
= cxp ( - iAw2 + O (|u| + A|u|3)) . 

(ii) P{Ty ^i}< cxp{-\h{u))forall £ = A(l + m) G Nq. 
(iii) For all w > 0, 

P{\W-X\ > Xw}< 2cxp(-A/i(w)) = 2cxp(- ^ Xw^ + O {X\w\^)) 

Proof. By Stirling's formula, 

log(«/y2^) = (£ + i) log(^) -i + n where ^^^ ^ ^^ ^ i^' 
Thus 

log(\/2^P{T4^ == £}) = -X + £log(A) - log{i\/V2^) + \ log(A) 
= -Xh{u) - \ log(l + w) + 0{l-^), 

which gives (i). 

For (ii), first note that V\W = 0} = e"^ = cxp(-A/i(-l)). For ^ > 1 we have 

log (y27rP{W = ^}) = -A + £log(A) - (£ + i) log(^) + ^ - r<, 

< -A + £log(A) - i\o%(i) +1^ Xh{u). 

Inequality (iii) comes from two appeals to the usual trick with the moment generating 
function Pe*^ = exp(A(e* - 1)). For w > 0, 

f{W > A + Aw} < inf ¥e^(w-\-\w) ^ j^^ t _ ^^,^ -^ w) + A(e* - 1)) 

The infimum is achieved at i = log(l + w), giving the bound exp(— Aft,(w)). Similarly 
V{W < A - Aw} < inf Pe'^^"^™"^) = inf exp (i(A - Aw) + A(e"* - 1)) 
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with the infimum achieved att — — log(l — w) if < w < 1 or as i — > oo if w = 1. The 
inequality is trivial for w > 1. D 

3. Heuristics and an outline of the proof of (1) 

We first show that A„(c) is almost a conditional expectation involving a set of inde- 
pendent random variables, Y = {Yij : 1 < i,j,< k}, each distributed Poisson(Ajj) with 
\ij ~ n/k"^ for all i, j. For t E Hk, 

The standardized variables Xij := [Yij — Kj)/\/Kj are approximately independent stan- 
dard normals. 

As we show in Section 4, the quantity 

converges to a strictly positive constant as n tends to infinity. Thus 

P2i£) := p{y = ^ I r e HJ - p{£)/v{Y e nk} 



'k 



By Stirling's approximation, the final fraction converges to a nonzero constant. The quan- 
tity An (c) is bounded by a constant multiple of 

/ , \ -2nc / „ \ nc 

That is, for some constant Cq, 

/ -, \ -2nc / „ \ nc 

where P2(-) denotes expectations with respect to the conditional probability distribution 
V{-\YeHk). 

Note the similarily to the usual chi-squared goodness-of-fit statistic, 

1^1':= E- Xf^^-n + nk'Y.- i^^jM'- 
The quantity in (3) equals the P2 expectation of 



kj \ \ kj nk^ I \(fc— 1)^ 

Our task has become: for a fixed Jk '■— c/(k — 1)^ < pk := log(A: — l)/(fc — 1), show that 

(4) P2exp(jfe|Xp) =0(1) asn^oo. 

Under P2, the random vector X has a limiting normal distribution M that concentrates 
on a (/c— 1) ^-dimensional subspace of M*^^'^. The random variable |Xp has an asymptotic 
xji distribution with R — [k — 1)^. If we could assume that \X\'^ were exactly x|.- 
distributed, we could bound the conditional expectation in (4) by a constant times 



t^/^-'^ exp {ct/R - t/2)) dt, 
which would be finite for c < R/2 = (fc - 1)^/2. 
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To make the argument rigorous we will need to consider the contributions from the 
large \Yij — n/fc^ |'s more carefully. As a special case of Theorem 3 in Section 4, we know 
that for each fixed 9 > 1 there exists ad ^ Sg for which 

(5) P2exp(jfc|X|2) {\X\ < SV^} < eMeM0^Jk\x\^)- 

The expectation with respect to the normal distribution M can be bounded as in the previ- 
ous paragraph because |xp ^ xfk-i)2 under A/". 

To control the contribution from {\X\ > Sy/n} it is notationally cleaner to work with 
the variables Uij := {Yij — Xij)/Xij, that is, U ~ kX/y/n. Write U for the set of all 
u in M*^^'^ for which \ij{l + Uij) € No for all i,j and (because Y is constrained to lie 
in Uk), 

(6) ^ 1 < Uij < k ~ 1 and \J Uij = = 2_, ^ij . 
We need to bound 

Pexp {nJk\U\^/k^) {\U\ > kS}/V{Y e Hk} 

= 0(n(2'=-i)/2)y {\u\ >fc<5}P{C/ = w}cxp(nJfc|w|Vft2) 



From Lemma 1 , 

¥{U = u} < TT exp(-n/i(u„)/fc2), 
■i-i-ij 

which leads us to the task of showing that 

Here we can make use of an inequality (proved in Section 5) that controls the exponent 
in (7). Recall that h{t) = (1 + t) log(l + t) -t and pk = log(A: - l)/(fc - 1). 

Lemma 2. For each u — (ui, . . . , Uk) G M.^ for which ^ Uj ~ and —1 < Uj < k — 1 
for all j, we have Y,j K'^j) > Pk J2j "i- 

When invoked for the sum over j for each fixed i, the Lemma bounds (7) by 

y^ {|u| > /c(5}exp (—neo\u\'^) where eq := {pk — Jk)/k'^ > 0. 

The set {u e U : 2^k5 < \u\ < 2''+^k6} has cardinality of order 0((n2^)'='). The last 
sum is less than 

which decreases exponentially fast with n. 
The bound asserted in (4) follows. 

4. Limit theory for conditioned Poisson distributions 

The main result in this Section is Theorem 3, which shows that the contributions to the 
left-hand side of (4) from a large range of X values can actually be bounded using the 
X^ -approximation. 

Suppose Y = (li, . . . ,Yg) is a vector of independent random variables with Yi dis- 
tributed Poisson(Ai). Define 

A:=(Ai,...,A,) and D := diag{\Y\. . . .Xl/^). 
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For the rest of this section assume that v := ^^ Xi converges to infinity and that there 
exists some fixed constant r > for which 

(8) ly > max A^ > mini K > tv. 

The various constants that appear throughout the section might depend on r. 

Suppose Vi , . . . , Vs are fixed vectors in Z"^ that are linearly independent, spanning a 
subspace £ of M''. The linear independence implies the existence of nonzero constants Ci 
and C2 for which 

(9) Ci maxa |ia| < | T^ taVa\ < C2 max^ \ta\ for all ta e ffi. 
We also assume that 

(10) z9n(A®/:)7^0. 

Under similar assumptions, Haberman (1974, Chapter 1) proved a central limit theo- 
rem for the random vector X :— D~^{Y — A) conditional on the event {Y G A © £}. 
The limit distribution A/a is that of a iV(0,/g) conditioned to lie in the s-dimensional 
subspace D^^C. More precisely, J\f\ has density 4>{x) — (27r)^''/^ exp(— |a:;| /2) with 
respect to Lebesgue measure mx on the subspace D^^C. 

We will write Q(-) to denote expectations under P(- | F G A ffi £}. That is, for the 
conditional expectation of a function of Y, 

FfiYHYeX^C} 
^■' ^ ' P{y e A ® £} 

For the calculations leading to inequality (5), the qxl vectors are more naturally written 
as k X k tables. The vector of means becomes a table A — {Xij '■ 1 < i,j < k} with 
Xij — n/k"^ for all i,j. The constraints on row and column sums can be written using the 
2fc tables with ones in a single row or column, zeros elsewhere, but only 2fc — 1 of those 
tables are linearly independent. Thus q = k^ and s = k^ — {2k — 1) = (fc— 1)^ and v ^ n. 
The Q in this Section corresponds to the P2 from Section 3. 

Foreachw e Z^ define z^u :— ^^^^ WqVq, apointof Z''. The key idea in Haberman's 
argument is that the space C is partitioned into disjoint boxes 

Bw ■■= {V" ^ tiVi : [ti\ = Wi} = z^ ® Bo for w e Z", 

each containing the same number, Ky, of lattice points from Z^. Assumption (10) ensures 
that Ky > 0. 

Theorem 3. Suppose g is a uniformly continuous, increasing function. Then for each 
9 > 1 there exists a 5 > Q and a subset Cg of C for which 

(i) {x e D-^C : \x\ < 6y^} C D'^Cs 
(ii) Qexp(5(|X|2)) {X G D-^Cs} < eMxe^p{g{e^\x\^)) {x e D-^Cs/e} 

The proof of the Theorem will be given at the end of this Section, as the culmination 
of a sequence of lemmas based on the elementary facts from Section 2. We first show that 
most of the contributions to the P2 and A/a probabilities come from a large, bounded subset 
of£. 

Lemma 4. For each 6 > define Ws := {w G Z" : maxc \wa\ < Siy} and Cg := 
^weWsBw There exists a constant Cg > Ofor which 

¥{Y iXQCsj+N'xiD-^CI)} = Oie-^""). 
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Proof. Ify e \®{C\Cs) then y — \ G z^u © Bq for some w with max^ | w^ | > 5v, which 
impHes 

vfcmax \yi — A^j > |y — A| > |z^| — diam(i?o) > Ci(5i^ — C4. 

Define 5q :— Ci6/{2\/k). When i^ is large enough we have {Ci6v — C^j^fk > 8qv > 
So maxi Xi, so that 

p{r ^ A ® A} < V vm - K\ > SoX^}. 

^ — ^i 

Invoke Lemma 1 to bound the ith summand by 2exp ( — Ai(5o/2 + ©((JpAi)). With a 
possible decrease in So we can ensure that the XiSo/2 is at least twice the other contribution 
to the exponent. 

Similarly, if x e D^^{C\£s) and 1^ is large enough then |x| > Soy/i' and the contribu- 
tion from A/a is bounded by a sum of tail probabilities for the standard normal. D 

Next we use Lemma 1 to get good pointwise approximations for ¥{Y = £} when \£ — X\ 
is not too large. 

Lemma 5. For each 6 > \ there exists a 5 > Q such that, for all i — X + Dx in Ng for 
which maxi A^ \ii ^ Xi\ < S, 

0~V (^x) < i^«/¥{Y = i}h{x) < 9(j) {Ti/e) 

where 7(A) :— (27r)''/^ Y\^{2'KXi/v)^^/'^, a factor that stays bounded away from zero and 
infinity as v — > 00. 

Proof. From Lemma 1 , 

where, for some constant C3, 

\n\ < Cadx.l + |x,|3)/y^ < C3(5(l + x2). 
The asserted inequalities follow if S is small enough. D 

Next we sum over the pointwise approximations to get bounds for the probability that Y 
lies in one of the boxes that partition A ® £. The sum for the box A ® B^ will run over the 
lattice points of the form A + Dx with x in the set 

A"^ == {a; e D-^Byj : X + Dx e N^}. 

Lemma 6. For each 9 > 1 there exists a S > such that, for all w in Ws and v large 
enough, 

e-^Mx {D-^B^e) < ^.(''-^^/^{y e a e b™}//3(a) 

< eMx {D-^B^/9) 

where /3(A) is a factor that stays bounded away from zero and infinity oi i^ — > 00. 

Proof. As the proofs for the two inequalities are similar, we consider only the upper bound. 
Define -x.^ := D~^Zuj. By inequality (9) we have |z^| < C2Sh' and hence |xtu| < 
C^Sy/i' for some constant C4. Similarly, for each y = A + Dx in A © B^ we have 
\y — X — z^\ bounded by a constant, which implies |x — x^^,| < C^j ^v and hence 

I |xp - |x„|2 I < ^0 := Cllv + 2{C^I^)C4S^. 
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It follows that for each e > and a close enough to 1, 

sup{|</)(x/CT)/</>(x^/cr) - 1| : X e D-^B^) < e 

if u is large enough and 5 is small enough. 

Taking a equal to the 6 from Lemma 5 we then have 

P{reA®B^} = y V{Y = \ + Dx} 

Similarly, 

Mx{D-^B^/e) = f{et e D-^B^}<P{t)nixidt) 

^ 9-' /{x e i?-iB^}0(x/0)mA(rfx) 

The invariance properties of Lebesgue measure imply existence of some function ^(A) that 
stays bounded away from zero and infinity as v tends to infinity, for which m.x{D~'^ Bq) = 
i^-"/2/x(A). Thus 

¥{Y e X® B^} < 9'+'l^,.-^'^-^^/^^^^Mx{D-'B^je). 

Choose e small enough and replace 6* by a value closer to 1 to get the upper half of the 
asserted inequality, with f3{\) = Ky7(A)//x(A). D 

Corollary 7. P{Y e A ® £} = „-(i--')/2 (^(A) + o(l)) 

Proof. From Lemmas 4 and 6, for each 6 > 1, 

V{Y e A ® £} = V{Y 1^ A ffi £5} + y {we Ws}r{Y eX(SB^} 

< o{e-^n + 0/3(A)i.-(«-^)/2 Y.J'" ^ mWx (D-'B^/e) 

<6'i/-(«-")/2(/?(A)+o(l)) 
The argument for the lower bound is similar. D 

Corollary 8. For all v large enough, 

e-^Nx{D-^B^9) < Q{Y eX®B^}< dAfx{D-^B^/6) 
for all w € Ws- 

We now have all the facts needed to prove Theorem 3. The argument is a slight modi- 
fication of the method used to prove Lemma 6. Start with the S and Cg from that Lemma. 
Assertion (i), modulo an unimportant constant, was established at the start of the proof of 
the Lemma. 

Define f{x) :— cxp{g{\x\'^)). From the proof of the Lemma we know that | |xp — 
|x^ P I < ^0- By uniform continuity of g, if S is small enough we then have 

\g{\x\Va^) - gi\x^\ya^)\ < e all a; G D'^B^, all a « 1 
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and hence 

e^\f{xn,/<7) < f{x/(T) < e'-j{x^la) all x e D-^B^, all ct « 1. 
Use the bounds on / on D^^B^, to deduce that 

Q/(X){y e A ® B„} < e'f{x^)Q{Y G A ® B^} 

< e'^ f {9xw)9N\{D^^ Bw / 0) as g is increasing 

< e^''eNxf{ex){x e D-^B^/e} 

Sum over w in W^. to complete the argument. 

5. Proof of Lemma 2 
At a key step in the argument we will need the inequality 

(11) V(i) > 21og(l + 2f)/(l + 2t) foralH>0, 

for which, unfortunately, we have no direct analytic proof. However, the assertion is triv- 
ially true near the origin because the lower bound tends to zero as t tends to zero. For 
large t the ratio of tp{t) to the lower bound tends to 2. For intermediate values we have 
only a proof based on an analytic bound on derivatives together with numerical calculation 
on a suitably fine grid. It would be satisfying to have a completely analytic proof for (11). 
Define (7fe(s) := h{s)— pks^ . We need to show that the function G^ (u) := '^j<kSk{uj) 
is nonnegative on the constraint set. Suppose the minimum is achieved att — (ti, . . . , tj.). 
Without loss of generality, we may suppose —l<ti<t2<---<tk<k—l. We cannot 
have ti — —1 because /i'(— 1) — oo. Indeed, 

gk{ti +e) +gk{tk - e) - gkih) - gkitk) = eloge + 0(e), 

which would be negative for small e > 0. It then follows that tk < fc — 1 for otherwise the 
constraint ^ tj = would force tj ~ —1 for j < k. 

Use Lagrange multipliers (or argue directly regarding the first order effects of perturba- 
tions e with ^ Ej = 0) to deduce existence of some constant 9 for which g'kitj) = 9 for 
all J. 

Note that g'i^{s) = log(l + s) — 2pkS is concave (because g"(s) is decreasing) with 
5^(0) = and gj!(0) = 1 - 2pfe > 0. It follows that 6* < and that there are numbers 
— I<a6/<0<6e<fc — 1 with g'i^(ag) = 9 = g'j^{hg) such that tj equals j = ag for j < 
k — r and bg otherwise. That is {k — r)ao +rbg = and Gk(t) = r gk{bg) + {k — r)gk{ag) . 
Thus it suffices for us to show that the functions 

Mr,k{b) ■■= rgk{b) + (fc - r)gk{-rb/{k - r)) for < 6 < (fc - r)/r 

are nonnegative for r = 1, 2, . . . , fc — 2. 

For r > 2 and < 6 < (fc — 2)/2, inequality (11) shows that gk{b) is nonnegative: 

»,«^.0-.«)..('5i^-'=^).o 

because b h^ log(l + 2fe)/(l + 26) is a decreasing function. The function Mr^k(b) is then 
a sum of nonnegative functions on [0, (fc — r)/r]. 

It remains only to consider the case where r equals 1. To simplify notation, write fci 
for fc — 1 and abbreviate Mi^k to Mk- That is, 

Mk{b) = h{b) + kih{-b/ki) - pk {b^ + ki(b/kif) 

= (! + &) log(l + 6) + (fci - b) log(l - 6/fci) - kpkb^ki, 
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whence 



Ma&) - log 



l + b 
1 - b/ki 



2kpkb 



M'^ib) = 



k 'i.kpk 

{i + b){ki-b) ""fcT 



Notice that M'i!{b) > except on an interval Ik := {bk, b'^) in which the inequahty 2(1 + 
b){ki — b) > ki/pk holds. 

For A: = 3 or 4 the interval Ik is empty. The functions M3 and M4 are convex. They 
achieve their minima of zero at 6 = because M^(0) = 0. 

For fc > 5, the interval Ik is nonempty. The derivative M'i^{b) achieves its maximum 
value at b — bk and its the minimum value at 6'^,. For /c = 5 we have 65 « 2.19 and 
-^5(^5) ~ 0.055. Thus M5 is an increasing function on [0,4], achieving its minimum 
value of zero at 6 = 0. 
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Figure 1. Plots of Mk(b) for various values of fc. The vertical lines 
mark off the intervals Ik where the functions are concave. 



For fc > 6 a more delicate analysis is required. The function Mk is concave on the 
segment Ik and convex on each of the segments [0, bk] and [6'^, fci]. The global minimum 



M',{k, - 2) = -^-^ " :>-^''''> + log ( ^^^ ) < forfc>6, 
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is achieved either at 6 = 0, with Mfe(O) = 0, or at the local minimum h* E (6'j,,fci) 
where M'f^{b*) = and M'i^{b*) > 0. From the change in sign of the derivative, 

M^(fci-l) = 21og(fci)/fc^ >0 forallfc 

2(fci + 2)iog(fco Ai-r 

fc2 +^°^1^ 2fci 

we deduce that fci — 2 < 6* < fci — 1. The convexity of M^ on [6^, fci] then gives a linear 
lower bound, 

Mfc(6*) > Mkih - 1) + (6* - fci + l)Af^(fci - 1) 

^ fci - 1 , ... 21og(fci) 

> for fc > 4. 
It follows that Mk is nonnegative also for k > 6. 
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